Classifying the socio-situational settings of transcripts of spoken discourses
نویسندگان
چکیده
In this paper, we investigate automatic classification of the socio-situational settings of transcripts of a spoken discourse. Knowledge of the socio-situational setting can be used to search for content recorded in a particular setting or to select context-dependent models for example in speech recognition. The subjective experiment we report on in this paper shows that people correctly classify 68% the sociosituational settings. Based on the cues that participants mentioned in the experiment, we developed two types of automatic socio-situational setting classification methods; a static socio-situational setting classification method using support vector machines (S3C-SVM), and a dynamic socio-situational classification method applying dynamic Bayesian networks (S3C-DBN). Using these two methods, we developed classifiers applying various features and combinations of features. The S3C-SVM method with sentence length, function word ratio, single occurrence word ratio, part of speech (POS) and words as features results in a classification accuracy of almost 90%. Using a bigram S3C-DBN with POS tag and word features results in a dynamic classifier which can obtain nearly 89% classification accuracy. The dynamic classifiers not only can achieve similar results as the static classifiers, but also can track the socio-situational setting while processing a transcript or conversation. On discourses with a static social situational setting, the dynamic classifiers only need the initial 25% of data to achieve a classification accuracy close to the accuracy achieved when all data of a transcript is used. 2013 Elsevier B.V. All rights reserved.
منابع مشابه
Core Units of Spoken Grammar in Global ELT Textbooks
Materials evaluation studies have constantly demonstrated that there is no one fixed procedure for conducting textbook evaluation studies. Instead, the criteria must be selected according to the needs and objectives of the context in which evaluation takes place. The speaking skill as part of the communicative competence has been emphasized as an important objective in language teaching. The pr...
متن کاملDeath risk classifying in patients with internal medical emergencies in pre-hospital settings
Introduction: In a pre-hospital emergency, identifying high-risk medical patients and appropriate decision making is very important. The classifying of life-threatening risks in pre-hospital settings can improve the decision-making process. This study was purposed to classify the risk level of death in patients in pre-hospital emergency settings. Materials and Methods: This study was a descript...
متن کاملThe Impact of Translation in Emerging Iran’s Political–Religious Intellectual Discourses and Socio-Cultural Changes (From Early Qajar Dynasty to the End of Reza Shah Era)
Wide-ranging sociological studies have been conducted on the history of Iranian intellectuality and modernism. The findings jointly acknowledge that due to communication with the West and following the effects that was received from modernism, the first generation of Iranian intellectualism was emerged. It is said that having benefited from the translated works in its general concept, i.e. incl...
متن کاملThe Role of Disfluencies in Topic Classification of Human-Human Conversations
We investigate the impact of disfluencies on the task of classifying natural human-human conversations into topics. Disfluencies are distinctive to spoken language, and their effect on a number of spoken language understanding tasks, including spoken language classification, remains largely unknown. We use a subset of Switchboard-I annotated for disfluencies and topics, and investigate the effe...
متن کاملAdaptive Language Modeling with a Set of Domain Dependent Models
An adaptive language modeling method is proposed in this paper. Instead of using one static model for all situations, it applies a set of specific models to dynamically adapt to the discourse. We present the general structure of the model and the training procedure. In our experiments, we instantiated the method with a set of domain dependent models which are trained according to different soci...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 55 شماره
صفحات -
تاریخ انتشار 2013